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(57) Abstract 

An apparatus for communicating video frames from a source (102) having a video input means (1 10) to a destination (202) having a 
visual display unit (204) during the course of a telephone conversation comprises: first and second telephones (122, 222) connectable to a 
telephone network (8); an audio-video service provider (6) that can communicate over the network (8) with both telephones along first and 
second communication paths (130, 230); first and second computers (102. 262) in proximity with respective telephones (122. 222), at least 
one of the computers having a video input means (1 10) and each computer having a data input means (106. 206). a visual display unit (104, 
204). and communication means so that the computers can connect to the service provider respectively along third and fourth communication 
paths (1 16. 216). A first user can use the first computer communication means to make a first connection to the service provider (6). The 
service provider or first user then initiates a telephone call from the first to the second telephones (122. 222) via the service provider (6) 
and the first and second communication paths (130. 230). A second user can then use the second computer communication means to make 
a second connection to the service provider (6). Caller-id data is communicated between each of the first and second users and the service 
provider (6) so that the service provider can correlate both the call and the first and second connections with each other. At least one video 
image is uploaded from the video input means (1 10) to the service provider (6). and then downloaded to one of the computers. 
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AUDIO-VIDEO CONFERENCE SYSTEM WITH PARALLEL NETWORKS 

The present invention relates to a method and apparatus 
for communicating one or more video frames from a source 
5 having a video input means to a destination having a 
visual display unit during the course of a telephone 
conversation. 

Known multimedia communication terminals are integrated 
10 devices, that is, they contain both audio and video 
- communication portions linked, together with internal 
circuitry and/or software which synchronise audio and 
video digital data or analogue signals (herein referred to 
collectively as 'signals"), and hence corresponding 
15 audible and visible output from the integrated terminal. 
Such synchronisation is particularly desirable in order to 
achieve so-called 'lip- synch" in audio-video telephony. In 
practice, for the signals to be effectively synchronised, 
it has been found in the broadcast industry that the 
20 visible output should lead the audible output by no more 
than about 20 ms, and should lag the audio signal by no 
more than about 40 ms. 

Examples of known integrated terminals operating according 
25 to the ITU-T Recommendation H.320 'Narrow-band Visual 
Telephone Systems and Terminal Equipment" include those 
sold by PictureTel Corporation under the trade mark the 
Venue -2 000 and those sold by VTEL Corporation under the 
trade mark Enterprise Series Room System TCIOOO . Intel 

30 Corporation sells a business video conferencing system 
under the product code PCVD1013ST that operates according 
to the H.320 and H.323 standards. An example of a known 
integrated .terminal operating according to the H.324 
standard is those sold by 8x8 Inc. under the trade mark 

35 ViaTV Phone. 
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Such known terminals are designed to function in an audio- 
only mode, so that these can function as a simple 
telephone when communicating with another telephony 
terminal . 

5 

Such integrated devices have yet to become widely adopted, 
and one reason for this is that whilst such devices may 
function as a telephone, users still need a conventional 
telephone for communication, for example with other 

10 telephones on the same PBX exchange, or with external 
telephones. Many PBX manufacturers now support ISDN lines 
to the desktop for the provision of multimedia 
communication terminals, according to the H.320 standard. 
Unfortunately, different manufacturers provide different 

15 levels of functionality (for example, features such as 
call hold, call transfer and call forward) , not all of 
which are supported by multimedia terminals. Again, the 
result is the need to have more than one telephony device 
on a desktop. 

20 

Most users therefore end up with two telephony devices on 
their desktop. This is inconvenient, owing to the extra 
desktop space normally required, as well as the need to 
have a different telephone number for each device. Callers 
25 must therefore keep track of two numbers, and decide in 
advance which type of call they intend to place. 

The recent emergence of the H.323 standard for multimedia 
communications over packet networks, for example local 

30 area networks (LANs) using the Internet Protocol over 
Ethernet, has added further complications for the user, 
because the multimedia terminal must then in most cases 
connect to the data network, rather than the telephony 
network. It is well known that the typical data network is 

35 not as resilient or reliable as the telephony network. 
Many H.323 multimedia terminals are PC based, the user 
rightly fears losing his telephony facility when the LAN 
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or his PC crashes or fails. Therefore, the user still 
needs more than one telephony device. 

Furthermore, a critical mass of audio-video - equipment does 
not yet exist. Users and potential users of audio-video 
telephony are therefore unable to communicate using audio - 
video telephony with a large number of people whom they 
may call. This further inhibits the adoption of audio- 
video telephony. 

Another problem concerns security arrangements such as 
firewalls for computers on data networks, for example 
LAN's and Intranets, in order to prevent unauthorised data 
from entering or leaving the network. Although firewalls 
generally permit activities such as the receipt and 
sending of e-mail messages or the browsing of web pages, 
these firewalls are not normally compatible with packet- 
based audio-video communication using the data network 
such as those conforming to H.323 or SIP (Session 
Initiation Protocol) . Therefore, even if two users had the 
requisite audio-video terminals, they would not be able to 
communicate if on opposite sides of the firewall. Although 
special firewalls (e.g. H.323) can in principle be used to 
allow such communication, network managers may be 
unwilling to invest in such special equipment simply 
because the demand for audio-video communication is too 
low. 

It is an object of the present invention to provide a more 
convenient audio and video communication method and system 
in cases where it is not necessary for video to be 
synchronised with the audio in the course of an audio- 
video telephone call so that both parties need not be 
equipped with fully capable multimedia terminals. 

According to the invention, there is provided an audio and 
video communication system, comprising: a first audio 
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telephony device and a second audio telephony device that 
may cotmnunicate with each other over a telephony 
communications network; an audio-video service provider 
that may communicate over said telephony communications 
5 network with both the first and second telephony devices 
respectively along a first communication path and a second 
communication path; a first computer and a second 
computer, said computers being in proximity with 
respectively the first telephony device and the second 
10 telephony device, at least one of the computers having a 
- video input means and the first computer and second 
computer each having a data input means, a visual display 
unit, and communication means by which the first computer 
and second computer may make a connection to the service 
15 provider respectively along a third communication path and 
a fourth communication path; in which the first telephony 
device and first computer are operable by a first user and 
the second telephony device and second computer are 
operable by a second user, wherein the system is capable 

20 of: 

i) permitting the first user to use the first computer 
to make a first connection to the service provider; 

25 ii) permitting the initiation of an audio telephony call 
between the first telephony device and the second 
telephony device via the service provider along the first 
and second communication paths ; 



30 



35 



iii) communicating caller-id data between the first user 
and the service provider along the first communication 
path and/or the third communication path so that the 
service provider can correlate both the call and the first 
connection with each other; 

iv) uploading at least one video image from the video 
input means to the service provider; 
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V) permitting the second user to use the second computer 
to make a second connection to the service provider; 

vi) communicating caller- id data between the second user 
and the service provider along at least the fourth 
communication path so that the service provider can 
correlate both the call and the second connection with 
each other; 



vii) downloading said video image (s) from the service 
provider to the first computer and/or second computer to 
be displayed on the respective video display unit(s) when 
the second user has answered said telephone call and when 
15 both the first and second connections are correlated with 
the call . 

Also according to the invention, there is provided a 
method of making an audio-video telephone call using an 
20 audio and video communication system that comprises: a 
first telephony device and a second telephony device that 
may communicate with each other over a telephony 
communications network; an audio -video service provider 
that may communicate over said telephony communications 
25 network with both the first and second telephony devices 
respectively along a first communication path and a second 
communication path; a first computer and a second 
computer, said computers being in proximity with 
respectively the first telephony device and the second 

30 telephony device, at least one of the computers having a 
video input means and the first computer and second 
computer each having a data input means, a visual display 
unit, and communication means by which the first computer 
and second computer may make a connection to the service 

35 provider respectively along a third communication path and 
a fourth communication path; in which the first telephony 
device and first computer are operable by a first user and 



wo 00/45579 



PCT/GBOO/00203 



- 6 - 

the second telephony device and second computer are 
operable by a second user, wherein the method comprises 
the steps of: 

5 a) using the first computer communication means to make 
a first connection to the service provider; 

b) initiating an audio telephone call between the first 
telephone and the second telephone via the service 

10 provider and the first and second communication paths; 

c) communicating caller-id data between the first user 
and the service provider along the first communication 
path and/or the third communication path so that the 

15 service provider can correlate both the call and the first 
connection with each other; 

d) uploading at least one video image from the video 
input means to the service provider; 

20 

e) using the second computer communication means to make 
a second connection to the service provider; 



f) communicating caller- id data between the second user 
25 and the service provider along at least the fourth 

communication path so that the service provider can 
correlate both the call and the second connection with 
each other; 

g) downloading said video image (s) from the service 
provider to the first computer and/or second computer to 
be displayed on the respective video display unit(s) when 
the second user has answered said telephone call and when 
both the first and second connections are correlated with 
the call . 



30 



35 



10 
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The video image (s) may be uploaded and/or downloaded via 

the third communication path and/or the fourth 
communication path. 

i Alternatively, if the communication means includes means 
by which the first computer and second computer may make a 
connection to the service provider respectively along a 
fifth communication path and a sixth communication path, 
the method may coti^rise the steps of : 

- h) communicating to the service provider the video 
communication capabilities of the first computer and/or 
second computer,- 

15 i) making a connection between the service provider and 
the first computer and/or between the service provider and 
the second computer along the fifth communication path 
and/or the sixth communication path; 

20 j) uploading at least one video image from the video 
input means to the service provider along the fifth 
communication path and/or sixth communication path; and/or 
downloading at least one video image from the service 
provider along the fifth communication path and/or sixth 

25 communication path to be displayed on the respective video 
display unit(s) . 

The term ^telephony device" as used herein means any 
device that can be used for speech telephony and the term 

30 * telephony communications network" means any 
communications network that can be used to carry speech 
telephony signals. In the basic embodiment of the 
invention the telephony device is a simple telephone and 
the telephony communications network is the Public 

35 Switched Telephone Network (PSTN) or the Mobile Network. 
In other embodiments of the invention, the telephony 
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device may be a LAN based telephone or a sound card, 
microphone and headset in a PC, and the telephony 
communications network may be the data network. Also the 
telephony device may even be part of a multimedia 
terminal. However, for the purposes of this document 
-telephone" is used to mean "telephony device" in the 
broadest sense, and likewise 'telephone network" is used 
to mean telephony communications network" in the broadest 
sense. 



Likewise, the placement of the telephone call (routed) 
'via the service provider' is also meant in the broadest 
sense. The service provider may indeed terminate both legs 
(first and second communication paths) of the telephone 
15 call and route the audio through its own equipment. In 
this case the first and second communication paths carry 
call signalling information and also speech. This may be 
the case, for example, when the service provider requires 
the first user to enter caller- id data via their telephone 
20 keypad. Alternatively, the service provider may just have 
knowledge of the telephone call between the first and 
second user. In this case the service provider has 
notification or signalling interfaces to/from the 
telephony communications network and the first and second 
25 communication paths carry only signalling information, and 
the audio media travels directly through the telephone 
communications network between the telephony devices. 
Examples of such signalling interfaces includes computer 
telephony integration (CTI) interfaces to PBX's, and 
interfaces to the Intelligent Network (IN) in the PSTN. 
When the telephony network is an H.323 packet network, 
such a signalling interface can be provided through the 
GateKeeper interface for example. 



35 The term 'multimedia terminal" as used herein means any 
device or collection of devices used for audio video and 
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optionally data cotnmunications . The multimedia terminal 
may be formed from separate audio and video communication 
devices, for example a telephone and a video display unit 
of a personal computer optionally equipped with a camera 
and capture card. Alternatively, the multimedia terminal 
may be formed from the combination of an integrated 
multimedia terminal having both audio and video 
capability, but using only the video function, in 
combination with a separate audio device, for example a 
telephone. Finally the multimedia terminal may be an 
integrated multimedia device. 

The term ''caller-id data" as used herein means any type of 
data or information that serves to identify the first or 
the second user, the association between their telephony 
devices and their computers, and/or their calls and 
connections. Examples of caller-id data include telephone 
numbers, computer network addresses and access codes, 
including security codes and account codes. 



Either user may communicate their telephone numbers and 
their computer network addresses on their computer 
connections (third and fourth communication paths), and 
the service provider may make the association with their 
25 telephones, either by calling the telephones using the 
previously inputted telephone number or by matching the 
telephone number presented by the telephony communications 
network (Calling Line Identifier and Called Party Number 
services) against the previously inputted telephone 
3 0 numbers . 

When the service provider cannot immediately associate a 
user's telephone device and computer and the corresponding 
calls and connections (because, for example the user's 
35 telephone number has not been presented by the network 
and/or there is no computer connection) , the use of access 
codes may serve as caller- id data. The service provider 
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may require the use of access codes for security or 
billing purposes in addition to other forms of caller- id 
data. For example, the service provider may require the 
first user to enter an access code (e.g. account number 
5 and/or PIN) before allowing the telephony call to the 
second user to proceed. The access code entered will 
identify the first user to the service provider and allow 
the correlation with his computer to be made provided his 
computer connection is also made to the service provider. 

10 Another example of caller- id data includes an access code 
- communicated from the service provider to the first user 
along the third communication path as a series of numbers, 
letters or other symbols displayed upon the first computer 
visual display unit. The first user could then communicate 

15 these symbols back to the service provider along the first 
communication path, for example by using a touch tone key 
pad associated with the telephone, or if the service 
provider has voice recognition means, by speaking the 
access code back to the service provider. 

20 

In the basic embodiment of the invention, the first user 
initiates the telephone call to the second user from his 
telephony device. This need not always be the case. The 
invention is also applicable when the second user 

25 initiates the telephone call from his telephony device to 
the first user. Likewise, in a more sophisticated 
embodiment of the invention, the first user may initiate 
the telephone call from his computer along the third 
communication path. Caller- id data may contain his 

30 telephone number as well as the telephone number of the 
second user. The service provider may then place telephone 
calls out to both first and second users along the first 
and second communication paths connecting their telephony 
devices together. 



35 



Because the telephone call to the second user is routed 
via the service provider, association of calls and 
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connections may conveniently take place during the 
initiation of the telephone call, before the call has been 
forwarded to or answered by the second user. Then, by the 
time the second user has answered the telephone call, the 
service provider has already associated the connection to 
the first user's computer with the telephone call. 

Images can then be uploaded from the first user's computer 
to the service provider. 



10 



The caller- id data communicated between the second user 
and the service provider includes caller- id data 
communicated by the first- user to the second user. For 
example, once the call has been answered, the first user 
15 may then communicate verbally to the second user the same 
access code. The second user can then use his computer to 
make a connection to the service provider, enter the 
access code, for example via a computer keyboard, in order 
to be able to download one or more images already uploaded 
20 to the service provider. 

If the caller-id data is sufficient to identify the second 
user and make the correlation between his telephone device 
and computer immediately, without further input from him, 
25 then the seirvice provider can automate the downloading of 
images to his computer once the telephone call has been 
answered . 



30 



Preferably, such video images are automatically downloaded 
at frequent intervals in order to approximate real time 
video. Indeed, it is the function of the audio-video 
service to provide the highest level of video 
communication consistent with the user's con^uter video 
capability. For example, if both first and second user do 
35 indeed have fully functional multimedia terminals then 
real-time video communication may be possible. If one user 
has no camera, for example, but does have a video decoder 
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on his PC, then only one-way video communication may take 
place. 

The video communication capabilities of a user computer 
5 may already be known from subscription and log-in 
parameters or may be determined dynamically through the 
execution of an applet on the user computer that 
communicates with the service provider. The eventual level 
of video communication will also be affected by the action 
10 of any firewall and the amount of bandwidth available. 

In a basic embodiment of the invention, said video images 
are downloaded manually by the first and/or second user, 
for example by activating a refresh or reload function in 

15 web-browser software. In some cases, this may be the 
highest level of video capablity that can be achieved 
because of firewalls or because one user has no video 
camera and/or video codec installed. In this case video is 
sent and/or received on the third and/or fourth 

20 communication paths, that is, the same communication paths 
that are used for data communications between the user's 
computers and the service provider. 

Where firewalls may need to be bypassed and/or more 
25 bandwidth needed for improved video communications, 
additional communication paths, a fifth communication path 
and a sixth path may be provided between the user's 
computer and the service provider. The fifth and sixth 
paths are intrinsically associated with the third and 
30 fourth communication paths. 

Where there are mechanisms for transferring such media 
through firewalls and/or sufficient bandwidth is available 
then the fifth and sixth communication paths may be on the 
35 same or different physical circuits as the third and 
fourth communication paths . 
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Therefore, the system may be capable of permitting the 
service provider to determine the video communication 
capabilities of the first computer and/or second computer 
for example via an applet received from the service 
5 provider by the first or second computers, or through 
subscriber login parameters, in order to provide a higher 
quality of video communication consistent with these 
capabilities and any restrictions imposed by firewalls 
and/or the availability of bandwidth That is, the method 
10 and system according to the invention are consistent with 
- fully functional multimedia terminals when firewalls are 
avoided (for example by connecting via access routers over 
the PSTN or other suitable networks) or where mechanisms 
to route such traffic through firewalls exist. 

15 

In all cases, it is preferred if images can no longer be 
downloaded when the telephone call is ended. This provides 
a degree of security against unauthorised downloading of 
such images . 

20 

In the above example, the caller- id data at all stages is 
the same. This need not be the case, however, and caller- 
id data at each stage can be different as long as the 
service provider is able to make the necessary 
25 correlations. 

In the above description, the initiation of the video 
element of the call is instigated by the first user (i.e. 
the user who has initiated the telephone call) . This need 
3 0 not be the case. The second user or the user receiving the 
telephone call may instead instigate the uploading and 
downloading of video images, particularly if the second 
user has video input equipment . The system and methods 
described above apply equally in this case. 

35 

In a one embodiment of the invention, the connection for 
the first computer and/or the second computer is made 
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through the Internet. In this case, the communication 
means includes web-browser software, the downloading 
and/or the uploading of said image being from/to a web 
page , 

5 

Alternatively, the connection for the first computer 
and/or the second computer may be made directly to the 
service provider through the public switched telephone 
network or other suitable networks, for example via an 

10 access router either connected directly to the computer, 
or via a local area network. Such an access router can 
therefore bypass any firewalls on the local network, but 
may be programmed only to dial into the service provider 
which can then be provided with its own firewall in order 

15 to safeguard the network. Such a direct connection can 
provide the benefit of a higher data rate than may be 
achievable over the Internet, in order to provide quick 
transmission of images, 

20 In a more sophisticated embodiment of the invention, the 
connection for the first computer and/or the second 
computer may use both the Internet and the PSTN - the 
Internet for data communication and the PSTN for video 
communicat ion . 

25 

The invention is also applicable to future networks that 
provided high bandwidth Internet connection, for example 
those using Cable Modem or DSL/ATM technologies. Provided 
mechanisms exist for passing such traffic through 
30 firewalls (if they are present) , these Internet 
connections may be used for both data communication, video 
communication and even audio communication. 

Alternatively, if both parties in a call belong to the 
35 same corporate organisation, the enterprise's Intranet 
and/or LAN made be used for the video communication paths . 
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The telephony communications network will in general 
comprise the Public Switched Telephone Network. The 
telephone network may also or alternatively comprise a 
local private telephone network, for example a PBX 
5 network, or the mobile network. Future telephony 
communications networks may also use the data network such 
as an Intranet or LAN, or future Internet networks. 

In one embodiment of the invention, the caller- id data 
10 between the first user and service provider is 
communicated over both the first communication path and 
the third communication path to the service provider, 
rather than from the service provider. 

15 Alternatively i the caller- id data between the first user 
and service provider may be communicated over the third 
communication path from the service provider to the first 
user, and over the first communication path from the first 
user to the service provider. 

20 

In a preferred embodiment of the invention, the service 
provider receives via the telephone network caller-id data 
including the telephone number of the first telephone. 
Such caller-id data may be typed into a touch tone 
25 telephone by the first user, but most conveniently this 
caller- id data is generated automatically by the telephone 
network when the first user makes the telephone call. 

Alternatively, the service provider may call the first 
30 telephony device using information supplied by the first 
user over the third communication path or from previously 
supplied subscriber or log-in information. The service 
provider may additionally call the second user when the 
information supplied by the first user over the third 
35 communication path also includes the telephone number of 
the second user. 
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Similarly, the caller- id data cotnmunicated between the 
second user and the service provider may include the 
telephone number of the second telephone. 

If the service provider comprises computerised voice 
recognition means, at least some of such caller- id data 
may be communicated to the service provider by voice. 
Alternatively or additionally, if a telephone has a touch 
tone keypad and the service provider has tone recognition 
means, this may be used to communicate at least some of 
the caller- id data to the service provider. 

When the data input means comprises a keyboard or a 
computer mouse, at least some of such caller- id data may 
15 be communicated to the service provider with the keyboard 
or mouse. 

The invention will now be described by way of example with 
reference to the accompanying drawings, in which: 

0 

Figure 1 is a schematic block diagram of a preferred 
embodiment of an audio and video communication system 
according to the invention; and 

5 Figure 2 is a flow diagram explaining a method of 

making an audio-video telephone call according to the 
invention. 

In Figure 1, an audio and video communications system 1 
0 has at a first location 100 one or more personal computers 
102, each comprising a visual display unit, or monitor, 
104 and data input means in the form of a keyboard 106 and 
a mouse 108. Seated near the computer 102, and most 
conveniently atop the monitor 104, is a conventional video 
5 camera 110 by which one or more video images may be 
captured by the computer. Each computer 102 may be 
connected 112 to a local area network (LAN) 114, however, 
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the invention is not limited to computers on LANs and is 
applicable also to stand-alone computers. 

Each computer 102 may via the LAN 114 connect 116 to the 
5 internet 2 through a conventional firewall 118 and router 
120. In the case of a stand-alone computer, the connection 
116 may be via a modem 117, shown in dashed outline. 

Conventional web-browser software and/or client software 
10 running in the personal computer 102 permits the user of 
- the computer to connect 124 via the Internet 2 to an 
audio -video service provider 6 along a path referred to 
herein as a third communication path. 

15 Also at the first location is at least one telephone 122 
with a touch tone key pad 123 in proximity 126 with the 
personal computer 102. The telephone 122 may be any type 
of telephone, including a fixed telephone connected 130 to 
a telephone network, for example the public switched 

20 telephone network (PSTN) 8, either directly or through a 
local private telephone network, for example a private 
branch exchange (PBX) 128, or a mobile (i.e. cellular) 
telephone connected to the PSTN 8 via a wireless network 
or a fixed telephone connected to a data network (not 

25 shown) . The user of the telephone 122 may then dial 
through 132 to the audio-video service provider 6 via the 
PSTN 8 along a path referred to herein as a first 
communication path. 

30 The computer 102 and telephone 122 are in proximity 126 
with each other in the sense that a user of the telephone 
may at some point in the process or setting up or having 
an audio-video telephone call, operate the data input 
means 106,108, view the monitor 104, or have his or some 

35 other image captured by the video camera 110. 
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The data rate achievable between the computer 102 and the 
audio- video service provider 6 through the Internet 2 may. 
be relatively low for the transmission of video images. 
This may not be a problem for the transmission of just one 
5 still image, or a series of separate still images. If 
greater data rate is desired, then a connection 134,136 
may be made through the PSTN 8, or other suitable 
networks, to the audio-video service provider 6 via a 
conventional access router 138. If this way of connecting 
10 bypasses conventional firewalls, the access router 138 
should then be limited to connecting through to secure 
sites such as the service provider 8, which would then 
normally be required to have their own internal firewall 
10. The connection via the access router 138 is therefore 
15 suitable if a plurality of video images are to be 
automatically uploaded (or downloaded) at frequent 
intervals in order to approximate real time video. In the 
future, users will have access to high-speed Internet 
access suitable for the transmission of a plurality of 
20 images. In this case, and if mechanisms exist to pass such 
traffic through firewalls (if present) and Internet 
routers, then data access to the service provider can be 
solely via the Internet 2 . 

25 The connection through the access router may be the 
initial connection made between the user and the service 
provider 6, in which case this connection is made along 
the third communication path. It is possible however, that 
this connection is made subsequent to a connection made 

30 through the Internet 2, in which case this connection is 
made along a path referred to herein as a fifth 
communication path. For example, the service provider 6 
may send an applet to the user's computer 102 in order to 
determine the video capability of the computer. If the 

35 service provider 6 is aware of the connection to the 
computer 102 via the access router 138, then the service 
provider can dial up an additional communications channel 
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to the computer 102 via the access router 138. Video 
images may then be uploaded (or downloaded) from the 
service provider via the access router 138. Where, high- 
speed internet access is available, the fifth and/or sixth 
5 communication path may take the same physical route as the 
third and/or fourth communication paths respectively. 

The arrangement described above allows a user at the first 
location 100 to make an audio-video telephone call to a 

10 second user at a second location 200. The second location 
200 has telephone and computer apparatus as described 
above for the first location 100. The telephone and 
computer apparatus and connections associated with the 
second location 200 are therefore labelled with similar 

15 reference numeral incremented by 100. 

The telephone and computer apparatus may connect to the 
audio-video service provider 6 in a similar manner to that 
described above for the first location 100, along paths 

20 referred to herein respectively as second and fourth 
communication paths. If the second computer makes a 
subsequent connection with the service provider. for 
example through the access router 238, then this 
connection is via a path referred to herein as a sixth 

25 communication path. 

The second location, however, does not need to have a 
video camera 210, which is therefore drawn in phantom. If 
the second location does have a video camera 210, then it 
30 becomes possible for the first and second users both to 
receive video images from the other. 

Reference is now made also to Figure 2, which shows the 
process involved in making an audio-video telephone call 
35 between the first location 100 and the second location 
200. The first user begins the process by using his 
telephone 122 to call 50 the service provider and to use 
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his computer 102 and web-browser software to make a 
connection 52 to the service provider 6. At this stage, 
the service provider has no way of correlating the 
telephone call with the computer connection. 

5 

There are many different ways in which caller- id data may 
be exchanged between the first user and the service 
provider. The flow chart of Figure 2 relates just to one 
possible way of making the correlation. 

10 

The service provider 6 sends 5.4 a web page to the first 
user's computer asking him to enter from his computer, for 
example by typing information into his keyboard 106, the 
phone number of the second user. Additionally or 
15 alternatively, the user will hear 56 a recorded voice 
message from the service provider 6 asking him to key into 
his telephone the telephone number of the second user. 

Optionally, the process may involve the exchange of a 
20 security code, or an account code. Such a code may also be 
associated with billing of the first user for the service 
provided by the service provider, in which case, the first 
user may enter 58 via his computer a unique code 
identifying the user. If the same code is entered 60 by 
25 telephone, then it may not be necessary to enter the 
telephone number also via the computer. 

The service provider 6 may be handling simultaneously many 
different calls and computer connections, and so the 
30 service provider waits 62 until it is able to make a match 
between the caller- id data associated with the connection 
and call with the first user. When a match is made, the 
service provider 6 is able to correlate the call the with 
the connection. 

35 

At this point, the user may hear 64 on his telephone a 
recorded voice saying the call to the second user will now 
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be placed. At about the same time, the service provider 6 
sends to the first computer 102 an applet so that it can 
determine the video conferencing capabilities of the first 
computer and if necessary have the first computer upload 
5 66 one or more video images to the service provider 6 from 
the video camera 110 . 

When the second user answers the call at his telephone 
222, he is told by the first user that he may make a 

10 connection from his computer 202 to the service provider 6 
in order to download one or more of the video images that 
have or will be uploaded from the first user to the 
service provider 6. The second user then uses 70 his 
computer 202 and web-browser software to make a connection 

15 to the service provider. 

At this stage, the service provider has no way of 
correlating the telephone call with the computer 
connection to the second computer 202. Again, there are 
20 many different ways in which caller-id data may be 
exchanged between the second user and the service 
provider. It is, however, necessary for at least some of 
the exchange with the second user to be with the second 
user's computer 202. 

25 

One way in which caller- id data may be exchanged is if the 
service provider sends 72 a web page to the second user's 
computer 202 asking the second user to enter his telephone 
number. Optionally, the second user may also be asked for 

30 a security or access code. This code could be the same 
code originally used by the first user, in which case the 
first user will have communicated this to the second user. 
However, if the code is associated with billing of the 
first user, it may be desirable for the second user to be 

35 supplied with a different code. 
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The service provider waits 74 until it is able to make a 
match between the caller-id data associated with the 
connection to the second computer 202 and the call now set 
up between the first and second users. When a match is 
5 made, the service provider 5 is able to correlate the call 
with connection to the second user. 

The service provider 6 then sends to the second computer 
202 an applet to determine the video conferencing 

10 capabilities of the second computer and so that the second 
computer is able to download 76. automatically, for example 
from a web page to which the second user has been granted 
access, one or more of the video images from the service 
provider 6. When the call is ended, the service provider 

15 terminates the uploading and downloading of images. 

It is not, however, necessary for the second user's 
computer 202 to receive an applet, as the second user may 
be granted access to web pages, so that he may manually 
20 download one or more images. Alternatively for fully 
functional video conferencing computers, proper video 
communication whether via the service provider or direct 
may be established if firewalls and bandwidth allow. 

25 There are many alternative possibilities for the exchange 
of caller- id data and the correlation of telephone call 
with a computer connection. 

For example, it is not necessary for the caller- id data to 
30 comprise the telephone number of the second user. The 
service provider 6 may send to the first user's computer 
102 a security code or uniquely generated access code, 
which the user then types into his telephone keypad after 
he has called the service provider. 



35 



Many telephone network automatically transmit the 
telephone number of the person making the call. This 



wo 00/45579 PCT/GBOO/00203 

- 23 - 

number may be intercepted by the service provider, and 
form part of the caller- id data. In this case it may not 
be necessary for the first user to enter any other caller- 
id data via his telephone, but just to supply additional 
5 caller- id data to the service provider from his computer, 
for example, by entering his telephone number from his 
computer . 

Another method, is for the service provider to have many 
10 telephone numbers, reserve one for this call and prompt 
the user via a web page to call this reserved number. 
Conversely, the service provider may in fact call the 
first user's telephone after he has entered its telephone 
number (or confirmed it) via a web page. 

15 

Similarly, since the service provider is supplied with the 
telephone number of the second user at the time the call 
is made, the caller- id data communicated between the 
second user and the service provider may include the 
20 telephone number of the second telephone. 

If the second user also has a video camera 210, then each 
computer may alternate between uploading and downloading 
images to each computer, so that both the first and second 

25 users may view one or more images from the other user's 
video camera. Indeed, if both user's computers also have 
video codecs, then the service provider can facilitate a 
better quality of service approaching real-time video 
communication subject to the available bandwidth and 

30 firewall restrictions. 

The audio and video communications system described above 
permits an audio-video call to be placed without the need 
for fully functioning multimedia terminals at each end of 
35 the call, though it can be consistently applied to such 
fully functional terminals. The system takes advantage of 
the common arrangement of a stand-alone or networked 
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personal computer in proximity with a conventional 
telephone on a different network. Voice communication may 
take place over standard telephone lines, whilst one or 
more video images may be received by at least one of the 
5 parties to the call. The invention may also help provide a 
level of audio and video communications consistent with 
the capabilities of the user's terminals and the networks 
to which they are connected. The system therefore 
facilitates the adoption of audio-video telephony. 
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Claims 

1, An audio and video communication system (1), 
comprising: a first audio telephony device (122) and a 

5 second audio telephony device (222) that may communicate 
with each other over a telephony communications network 
(8) ; an audio-video service provider (6) that may 
communicate over said telephony communications network (8) 
with both the first and second telephony devices (122,222) 

10 respectively along a first communication path (130,132) 
' and a second communication path (230,232); a first 
computer (102) and a second computer (202), said computers 
(102,202) being in proximity with respectively the first 
telephony device (122) and the second telephony device 

15 (222), at least one of the computers (102,202) having a 
video input means (110) and the first computer (102) and 
second computer (202) each having a data input means 
(106,108;206,208) , a visual display unit (104,204), and 
communication means (112, 117;212, 217) by which the first 

20 computer (102) and second computer (202) may make a 
connection to the service provider (6) respectively along 
a third communication path (116,124) and a fourth 
communication path (216,224); in which the first telephony 
device (122) and first computer (102) are operable by a 

25 first user and the second telephony device (222) and 
second computer (202) are operable by a second user, 
wherein the system is capable of : 

i) permitting the first user to use the first computer 
30 (102) to make a first connection to the service provider 

(6) ; 

ii) permitting the initiation of an audio telephony call 
between the first telephony device (122) and the second 

35 telephony device (222) via the service provider (6) along 
the first and second communication paths 
(130,132/230,232) ; 
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iii) cotnmunicating caller- id data between the first user 
and the service provider (6) along the first communication 
path (130,132) and/or the third communication path 

5 (116,124) so that the service provider (6) can correlate 
both the call and the first connection with each other ,- 

iv) uploading at least one video image from the video 
input means (110) to the service provider (6) ; 

10 

V) permitting the second user . to use the second computer 
(202) to make a second connection to the service provider 
(6) ; 

15 vi) communicating caller- id data between the second user 
and the service provider (6) along at least the fourth 
communication path (216,224) so that the service provider 
(6) can correlate both the call and the second connection 
with each other; 

20 

vii) downloading said video image (s) from the service 
provider (6) to the first computer (102) and/or second 
computer (202) to be displayed on the respective video 
display unit{s) (104,204) when the second user has 
25 answered said telephone call and when both the first and 
second connections are correlated with the call. 

2. A method of making an audio-video telephone call 
using an audio and video communication system (1) that 

30 comprises: a first telephony device (122) and a second 
telephony device (222) that may communicate with each 
other over a telephony communications network (8) ; an 
audio-video service provider (6) that may communicate over 
said telephony communications network (8) with both the 

35 first and second telephony devices (122,222) respectively 
along a first communication path (130,132) and a second 
communication path (230,232); a first computer (102) and a 
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second computer (202), said computers (102,202) being in 
proximity with respectively the first telephony device 
(122) and the second telephony device (222) , at least one 
of the computers (102,202) having a video input means 
5 (110) and the first computer (102) and second computer 
(202) each having a data input means (106, 108/206,208) , a 
visual display unit (104,204), and communication means 
(112,117;212,217) by which the first computer (102) and 
second computer (202) may make a connection to the service 

10 provider (6) respectively along a third communication path 
(116,124) and a fourth communication path (216,224); in 
which the first telephony device (122) and first computer 
(102) are operable by a first user and the second 
telephony device (222) and second computer (202) are 

15 operable by a second user, wherein the method comprises 
the steps of: 

a) using the first computer communication means 
(112,117) to make a first connection to the service 

20 provider (6) ; 

b) initiating an audio telephone call between the first 
telephony device (122) and the second telephony device 
(222) via the service provider (6) and the first and 

25 second communication paths (130 , 132 , -230 , 232) ; 

c) communicating caller-id data between the first user 
and the service provider (6) along the first communication 
path (130,132) and/or the third communication path 

30 (116,124) so that the service provider (6) can correlate 
both the call and the first connection with each other; 

d) uploading at least one video image from the video 
input means (110) to the service provider (6) ; 

35 
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e) using the second computer communication means 
(212,217) to make a second connection to the service 
provider (6) ; 

5 f) communicating caller- id data between the second user 
and the service provider (6) along at least the fourth 
coitimunication path (216,224) so that the service provider 
(6) can correlate both the call and the second connection 
with each other; 

10 

g) downloading said video image (s) from the service 
provider (6) to the first computer (102) and/or second 
computer (202) to be displayed on the respective video 
display unit(s) (104,204) when the second user has 
15 answered said telephone call and when both the first and 
second connections are correlated with the call. 

3. A method of making an audio-video telephone call as 
claimed in Claim 2, in which the video image (s) is/are 

20 uploaded and/or downloaded via the third communication 
path (116,124) and/or the fourth communication path 
(216,224) . 

4. A method of making an audio-video telephone call as 
25 claimed in Claim 2, in which the communication means 

(112, 117; 212/217) includes means by which the first 
computer (102) and second computer (202) may make a 
connection to the service provider (6) respectively along 
a fifth communication path (134,136) and a sixth 
30 communication path (234,236), wherein the method comprises 
the steps of: 

h) communicating to the service provider (6) the video 
communication capabilities of the first computer (102) 
35 and/or second computer (202) ; 
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i) making a connection between the service provider (6) 
and the first computer (102) and/or between the service 
provider (6) and the second computer (202) along the fifth 
communication path (134,136) and/or the sixth 
5 communication path (234,236); 

j) uploading at least one video image from the video 
input means (110) to the service provider (6) along the 
fifth communication path (134,136) and/or sixth 
10 communication path (234,236); and/or downloading at least 
one video image from the servipe provider (6) along the 
fifth communication (134,136) path and/or sixth 
communication (234.236) path to be displayed on the 
respective video display unit(s) (104,204) . 



15 



20 



25 



5. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 4, in which the 
communication means for the first computer (102) and/or 
the second computer (202) comprises a modem (117,217). 

6. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 5, in which said connection 
for the first computer (102) and/or the second computer 
(202) is made through the Internet (2) . 



7. A method of making an audio-video telephone call as 
claimed in Claim 6, in which the communication means 
includes web-browser or other client software, the 
downloading and/or the uploading of said image being 

30 from/to a web page. 

8. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 7, in which said connection 
for the first computer (102) and/or the second computer 

35 (202) is made directly to the service provider (6) through 
the telephony network (8) . 
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9. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 8, in which said telephony 
network comprises the public switched telephone network 
(8). 

5 

10. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 9, in which said telephony 
network comprises a local private telephone network 
(128,228) . 

10 

11. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 10, in which the caller- id 
data between the first user and service provider (6) is 
communicated over both the first communication path 

15 (130; 132) and the third communication path (230,232) to 
the seorvice provider (6) . 

12 . A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 10, in which the caller- id 

20 data between the first user and service provider (6) is 
communicated: over the third communication path (116,124) 
from the service provider (6) to the first user; and over 
the first communication path (130,132) from the first user 
to the service provider (6) , 

25 

13 . A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 12, in which the service 
provider (6) receives via the telephony network (8) 
caller- id data including the telephone number of the first 

30 telephony device (122) . 

14. A method of making an audio-video telephone call as 
claimed in Claim 13, in which said caller-id data received 
by the service provider (6) is generated automatically by 

35 the telephony network (8) . 
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15. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 14, in which the caller- id 
data communicated between the second user and the service 
provider (6) includes the telephone number of the second 

5 telephony device (222) . 

16. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 15, in which the caller- id 
data communicated between the second user and the service 

10 provider (6) includes caller- id data communicated by the 
first user to the second user. , 

17. A method of making an ' audio-video telephone call as 
claimed in any of Claims 2 to 16, in which the data input 

15 means comprises a keyboard (106,206), said keyboard being 
used to communicate at least some of the caller- id data. 

18. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 17, in which the data input 

20 means comprises a computer mouse (108,208), said mouse 
being used to communicate at least some of the caller- id 
data . 

19 . A method of making an audio-video telephone call as 
25 claimed in any of Claims 2 to 18, in which the first 

telephony device (122) and/or the second telephony device 
(222) has a touch tone key pad (123,223), said key pad 
being used to communicate at least some of the caller- id 
data. 

30 

20. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 19, in which the service 
provider (6) comprises computerised voice and/or tone 
recognition means, the first user and/or the second user 

35 communicating to the service provider (6) at least some 
the caller- id data by voice and/or tones. 
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21. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 20, in which a plurality of 
said video images are automatically downloaded at frequent 
intervals in order to approximate real time video. 

5 

22. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 20, in which said video 
images are downloaded manually by the second user. 

10 23. A method of making an audio-video telephone call as 
claimed in any of Claims 2 to .22, in which images can no 
longer be downloaded when the telephone call is ended. 

24 . A method of making an audio- video telephone call as 
claimed in any of Claims 2 to 23, in which the first user 

15 initiates the telephone call from the first telephony 
device (122) along the first communication path (130,132). 

25 . A method of making an audio- video telephone call as 
claimed in any of Claims 2 to 23, in which the second user 
initiates the telephone call from the second telephony 

20 device (222) along the second communication path 
(230,232) . 

26 . A method of making an audio-video telephone call as 
claimed in any of Claims 2 to 23, in which the first user 
initiates the telephone call from the first computer (102) 

25 along the third communication path (116,124). 
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First user uses his telephone to 
dial into the service provider 



First user uses his computer and 

web-browser to make a 
rnnnectiori tf^a service provider 



Service provider recorded voice 
asks first user to use his phone 
keypad to enter the phone number 
of the second user 
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Optionally, recorded voice asks | i O 
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Service provider sends web 
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enter from his computer the 
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Service provider waits until data entered by first user via his telephone and 
computer enables sen/ice provider to correlate the call and the computer 

connection with the first user 



Service provider recorded 
voice tells first user that call 
will now be placed 
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Service provider sends to first 
user's computer an applet to 
upload video images 
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Second user answers call, 
and is told by first user that 
he may use his computer to 
download images 



Second user uses his computer 
and web-browser to make a 
connection to the service 
provider 
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Service provider sends web 
page to second user asking him 
to enter from his computer his 
phone number (& optionally 
security code) 
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Service provider waits until data entered by second user via his computer 
enables service provider to correlate the call and the computer connection 

with the second user 
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Service provider sends to secondK 
user's computer an applet to 
download video images until 
such time as the call is ended 
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